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A METHOD AND SYSTEM FOR RESOURCE ALLOCATION 



FIELD OF THE INVENTION 

The present invention relates generally to a method and 
5 system for resource allocation. More particularly, the 
present invention allocates resources using technology 
graphs, passive and active searching, reinforcement learning, 
market driven decision making, reinforcement learning as well 
as p, tau, and patches techniques. 

10 

Background 

Modern computer networks for enterprises are 
tending towards the distributed computer system model in 
which various resources such as data, application programs, 

15 and computational power, are available at different locations 
on a network. Commercial products (e.g., Sun's Enterprise 
Java Beans, and Microsoft's COM, C0M+ and MTS systems) and 
standards (CORBA) exist that are directed at providing a 
common interface for diverse applications that will allow 

2 ^ platform- independent operation. 

For heterogenous components to work together 
successfully across an enterprise, the Object Management 
Group (OMG) adopted guidelines for how component interfaces 
are defined and components are invoked. Recently, OMG has 

25 been actively soliciting proposals from vendors for the 
development of an architecture and a series of detailed 
specifications for CORBA. The CORBA core includes 
standardized network communication protocols, interface 
mappings for various languages, interoperability, interface 

30 and implementation repositories, and calling conventions 
implemented by a CORBA Object Request Broker (ORB) . 
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An ORB is the middleware that established the 
client-server relationship between objects. Using an ORB, a 
client can invoke the method of a server object 
transparently. This server object can be on the same machine 
as the client object or across the network. The ORB 
intercepts the call and is responsible for finding an object 
that can implement the request, passing it data, invoking the 
requested operation and returning the results. 

Microsoft has been promoting its own alternative to 
CORBA for a distributed software component architecture. The 
Microsoft approach is based on Microsoft's COM. COM 
principally provides the architectural abstractions for 
Object Linking and Embedding (OLE) components. COM was based 
on several fundamental concepts: as application binary 
interface (ABI) for calling functions, a provisions for 
grouping functions into interfaces and a base interface. COM 
defined a standard for laying out virtual functional tables 
in memory and a standard for calling functions through 
pointers . 

Microsoft has developed the Microsoft Transaction 
Server (MTS) to extend COM to service other levels of 
brokering that may be required in an application. MTS is a 
component -based transaction processing systems for developing 
and deploying transaction-oriented processing on Windows NT 
Server. MTS also defines an application programming model 
for developing distributed, component -based applications. 

But each of these products is limited to providing 
a common interface for diverse application to allow platform- 
independent operation. Accordingly, there exists a need for 
algorithms that adaptively discover resource availability and 
balance the load among a set of resource providers . 
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Summary of the Invention 

The present invention is a method and system to 
adaptively discover resource availability from resource 
providers and to adaptively balance the load among the 
5 resource providers . 

The present invention includes a method for 
resource allocation comprising the steps of: 

receiving a plurality of resource requests and a 
plurality of resource offers, said resource requests and said 
10 resource offers having one or more properties; and 

determining at least one relation between at least 
one of said resource requests and at least one of said 
resource offers to identify at least one matching said 
resource offer for said resource request; and 
ig allocating said at least one resource request with 

said at least one matching resource offer. 

The present invention further includes a method for 
allocating a plurality of resources comprising the steps of: 
receiving information for at least one resource 

2Q request; 

computing an expected return for processing said 
resource request from said information; and 

processing said at least one resource request to 
optimize said expected return. 

The present invention further includes a method for 
25 y 

allocating a plurality of resources comprising the steps of: 

defining at least one algorithm having one or more 
parameters for allocating the resources; 

defining at least one global performance measure of 
said at least one algorithm; 

executing said algorithm for a plurality or 
different values of said one or more parameters to generate a 
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corresponding plurality of values for said global performance 
measure ; 

constructing a fitness landscape from said values 
of said parameters and said corresponding values of said 
5 global performance measure; and 

optimizing over said fitness landscape to generate 
optimal values for said at least one parameter. 



IQ Brief Description of Drawings 

FIG. 1 shows an illustration of the architecture of 
the system of the present invention. 

FIG. 2 provides a flow diagram describing a method 
executed by the resource providing nodes . 
15 FIG. 3 displays an exemplary technology graph. 

FIG. 4 provides a dataflow diagram 400 representing 
an overview of a method for synthesizing the technology 
graph . 

FIG. 5 provides a flow diagram for locating and 
selecting poly- functional intermediate objects for a set of 
terminal resources having a cardinality greater than or equal 
to two . 

FIG. 6 displays a flow diagram of a method for 
allocating resources using a market -based scheme which could 
also execute on a resource providing node. 

FIG. 7 provides a flow diagram for determining 
optimal values of parameters of methods performing resource 
allocation and load balancing. 

FIG. 8 discloses a representative computing system 
in conjunction with which the embodiments of the present 
invention may be implemented and executed. 



20 
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Detailed Description of the Preferred Embodiment 

The present invention allocates resources using 
technologies graphs, passive and active searching, 
5 reinforcement learning, market driven decision making, 
reinforcement learning as well as p, tau, and patches 
techniques . 

Without limitation, the following embodiments of 
the present invention are described in the illustrative 

1Q context of the allocation of resources in a distributed, 

computing system. However, it will be apparent to persons of 
ordinary skill in the art that other contexts can be used to 
embody the aspects of the present invention. These aspects, 
which are applicable in a wide range of contexts include 

jg receiving a plurality of resource requests and a plurality of 
resource offers, determining at least one relation between 
the resource requests and the resource offers to identify 
matching resource requests and offers and allocating the 
resource offers to its matching resource request. 

20 

System Architecture 

FIG. 1 shows an illustration of the architecture of 
the system of the present invention. The system includes 
resource requests 110, 112, 114, 116, which could be 
^ generated by a client application. The resource requests 
110, 112, 114, 116 have one or more properties. Exemplary 
properties for the resource requests 110, 112, 114, 116 
include identities and requirements. Exemplary properties 
for the resource requests 110, 112, 114, 116 further include 
attributes such as is-a, needs-a, has-a, uses-a, etc. 

30 

Exemplary attributes for resource requests further include 
requirements for data, software, computational or 
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communication resources at a specified level of quality of 
service . 

The system of the present invention further 
includes resource offers 13 0, 132, 134, 13 6, which could be 

5 generated by resource providers. The resource offers 130, 
132, 134, 136 have one or more properties. Exemplary 
properties for the resource offers 130, 132, 134, 136 include 
identities and abilities. Exemplary properties for the 
resource offers 130, 132, 134, 136 further include attributes 

10 such as is-a, has-a, needs-a, does-a, etc. Exemplary 

attributes for resource offers 130, 132, 134 further include 
data, software, computational or communication resource 
abilities at a specified level of quality of service. 

The system of the present invention further 

15 includes resource providing nodes (RPNs) 120, 122, 124, 12 6, 
128. The resource providing nodes 120, 122, 124, 126, 128 
communicate with resource providers 130, 132, 134, 136 and 
among themselves to maintain local databases of resource 
availability including quality of service specifications, and 
cost. In the preferred embodiment, the resource availability 
data at the resource providing nodes 12 0, 122, 124, 126, 12 8 
include different levels of quality of service for different 
costs. Preferably, RPNs 120, 122, 124, 126, 128 also make 
decisions about resource allocation on servers. These 
decisions may involve, but are not limited to, duplication of 

25 

data or application resources, or moving data, application or 
license resource from one server to another. 



Resource Allocation Method 

FIG. 2 provides a flow diagram describing a method 
200 executed by the resource providing nodes 12 0, 122, 124, 
12 6, 12 8. As is known to persons of ordinary skill in the 
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art, a dataflow diagram is a graph whose nodes are processes 
and whose arcs are dataflows. See Object Oriented Modeling 
and Design, Rumbaugh, J., Prentice Hall, Inc. (1991), Chapter 
1. In step 210, the method 2 00 receives resource requests 
110, 112, 114, 116. In step 220, the method 200 receives 
resource offers 130, 132, 134, 136 from resource providers. 
In step 230, the method 200 combines the resources offers 
130, 132, 134, 136 to form new resource offer combinations. 
In step 24 0, the method 2 00 searches for relations between 
the resource requests 110, 112, 114, 116 and the resource 
offers 110, 112, 114, 116 as well as the new resource offer 
combinations . 

In step 250, the method 200 evaluates the relations 
between the resource requests 110, 112, 114, 116 and the 
resource offers 110, 112, 114, 116 as well as the new 
resource offer combinations. In step 260, the method selects 
the relations that are optimal with respect to the evaluation 
and allocates the selected relation's resource offer to its 
corresponding resource request . 

Technology Graph. 

Execution of step 23 0 yields a technology graph 
235, which is a multigraph representation of the processes 
executed by the resource providing nodes 120,122, 124, 126 
and 128 to form combinations of resources to satisfy the 
resource requests 110, 112, 114, 116. FIG. 3 displays an 
exemplary technology graph. As is known to persons of 
ordinary skill in the art, a multigraph is a pair (V,E) where 
V is a set of vertices, E is a set of hyperedges, and E is a 
subset of P(V) , the power set of V. See Graph Theory, Bela 
Bollobas, Springer-Verlag, New York, 1979, {"Graph Theory") 
Chapter 1. The power set of V is the set of subsets of V. 
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See Introduction to Discrete Structures, Preparata and Yen, 
Addison-Wesley Publishing Company, Inc. (1973) ( "Introduction 
to Discrete Structures") , pg 216. 

In the technology graph (V,E) , each vertex v of the 
set of vertices V represents a resource. More formally, 
there exists a one-to-one correspondence between the set of 
resources and the set of vertices V in the technology graph 
(V,E) . A function denoted by g: O -> V from the set of 
resources O representing the resources to the set of vertices 
V in the corresponding multigraph (V,E) assigns the vertex v 
to a resource o (g(o) = v) . 

In the technology graph (V, E) , each hyperedge e of 
the set of hyperedges E represents a transformation as shown 
by FIG. 3. The outputs of the hyperedge e are defined as the 
intermediate resources 310 or the finished resources 315 
produced by execution of the transformation represented by 
the hyperedge e. The outputs of the hyperedge e also include 
the waste products of the transformation. The inputs of the 
hyperedge e represent the complementary resources used in the 
production of the resources of the hyperedge. Complementary 
resources are used jointly to produce other resources. 

The technology graph includes resources 305, 
intermediate resources 310, finished resources 315, and 
machines 32 0. Machines 32 0 are resources which perform 
ordered sequences of transformations on an input bundle of 
resources to produce an output bundle of resources. 
Accordingly, resources 310 are produced when machines 320 
execute their transformations on an input bundle of 
resources. A machine 320 which mediates transformations is 
represented in the technology graph H = (V, E) as an input to 
a hyperedge e. In an alternate embodiment, a machine 32 0 
which mediates transformations is represented as an object 
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which acts on the hyperedge e to execute the transformation. 
Finished resources 315 are the end products which are 
produced for the client making a resource request 110, 112, 
114, 116. 

5 The objects and transformations among the resources 

in the technology graph H = {V, E) constitute a generative 
grammar. As is known by persons of ordinary skill in the 
art, context-free grammars represent transformations or 
productions on symbol strings. Each production specifies a 

10 substitute symbol string for a given symbol string. The 
technology graph H = {V, E) extends the principles of 
context-free grammars from symbol strings and transformations 
among symbol strings to resources and transformations among 
resources. The expressiveness of the technology graph H = 

15 (V, E) is higher than that of context-free grammars as 
hypergraphs can represent multidimensional relationships 
directly. The technology graph H = {V, E) also expresses a 
context sensitive grammar. 

Each transformation in the technology graph H = ( V, 

2Q E) may specify a substitute hypergraph for a given 

hypergraph. Accordingly if a subgraph within a hypergraph 
matches a given hypergraph in a transformation, the subgraph 
is removed and replaced by the substitute hypergraph. The 
resulting hypergraph is derived from the original hypergraph. 

FIG. 4 provides a dataflow diagram 400 representing 

25 

an overview of a method for synthesizing the technology 
graph. In step 410, the technology graph synthesis method 
performs the initialization step. The technology graph 
synthesis method initializes the set of vertices V of the 
technology graph H = [V, E) to a founder set of resources. 

30 

The founder set contains the most primitive resources. Thus, 
the founder set could represent renewable resources. The 
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founder set can have from zero to a finite number of 
resources. The method also initializes a set of 
transformations, T , with a finite number of predetermined 
transformations in step 410. Finally, the method initializes 

5 an iterate identifier, i, to 0 in step 410. 

In step 415, the method determines whether the 
iterate identifier is less than a maximum iterate value, I. 
If the iterate identifier is not less than the maximum 
iterate value, I, the method terminates at step 430. If the 

10 iterate identifier is less than the maximum iterate value, I, 
then control proceeds to step 42 0. 

In step 420, the technology graph synthesis method 
applies the set of transformations, T, to the set of vertices 
V. In the first iteration of the loop of the flow diagram of 

15 FIG. 4, step 42 0 applies the set of transformations, T , to 
the resources in the founder set. First, step 42 0 applies 
each transformation in the set of transformations, T , to 
each resource in the founder set. Next, step 420 applies 
each transformation in the set of transformations, T , to all 

2Q pairs of resources in the founder set. Step 420 similarly 
continues by applying each transformation in the set of 
transformations, T , to each higher order subset of resources 
in the founder set. Execution of step 420 in iteration, i , 
yields the i th technology adjacent possible set of 
resources. Execution of step 420 in iteration, i , also 

25 

yields a modified technology graph H = {V, E) . The modified 
technology graph H = {V, E) contains additional vertices 
corresponding to the i th technology adjacent possible set of 
resources and additional hyperedges e corresponding to the 
transformations applied in the i th iteration of the loop of 

30 

the flow graph of FIG. 4. 
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In one embodiment, the method maintains all 
vertices created by execution of step 42 0 in the technology 
graph H = {V, E) . In an alternate embodiment, step 42 5 
prunes all vertices representing duplicate elements of the 
ith technology adjacent possible set of resources from the 
technology graph H = (V, E) . Accordingly, in the first 
embodiment of step 42 5, every resource constructed at each 
iteration of the method is kept in the technology graph H = 
(V, E) . Execution of the technology graph synthesis method 
400 using the first embodiment of step 425 produces a full 
technology graph H = (V, E) . In the alternate embodiment, 
only resources which were not elements in the founder set and 
which were not created in previous iterations of the loop of 
the flow diagram of FIG. 4 are added to the technology graph 
H = (V, E) . Execution of the technology graph synthesis 
method 400 using the alternate embodiment with the pruning of 
step 42 5 produces a minimal technology graph H = {V, E) . 
After execution of step 425, control returns to step 415. 

In subsequent iterations of the loop of the flow 
graph of FIG. 4, step 420 applies the set of transformations, 
T , to the resources in the set of vertices V of the 
technology graph H = (V, E) produced by the execution of the 
previous iteration of the loop. 

The set of transformations T can be held fixed 
throughout the execution of the technology graph synthesis 
method 400. Alternatively, new transformations could be 
added to the set of transformations and old transformations 
could be removed. For example, resources representing 
machines could also be included in the founder set of 
resources. Next, the set of transformations T could be 
applied to the resources representing machines just as they 
are applied to the other resources in the technology graph H 
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= {V, E) . Consequently, the set of transformations T could 
be limited to the transformations which are mediated by those 
machine resources represented by vertices of the technology 
graph H = (V, E) . 

5 

Technology Graph Applications 

The paths in the technology graph H = (V, E) which 
begin at vertices corresponding to resources in the founder 
set and end at vertices corresponding to finished goods 

10 represent the processes for producing the finished resources 
from the resources in the founder set. A path Pi of a 
hypergraph H - (V, E) is defined as an alternating sequence 
of vertices and edges v llt e ilr v i2i e i2i v i3r e l3 ,v i4/ e i4 . . . . such 
that every pair of consecutive vertices in P ± are connected 
by the hyperedge e appearing between them along P ± . As 
previously discussed, the vertices of the technology graph 
represent renewable resources, intermediate resources and 
finished resources and the hyperedges of the technology graph 
represent transformations. Accordingly, a path P ± in the 

20 technology graph H = (V, E) from a founder set to a finished 
resource identifies the renewable resources, the intermediate 
resources, the finished resources, the transformations and 
the machines mediating the transformations of the process. 
Thus, a process is also referred to as a construction 
pathway. 

25 

The technology graph H = {V, E) also contains 
information defining a first robust constructability measure 
of a terminal resource representing a finished resource. The 
first robust constructability measure for a terminal resource 
is defined as the number of processes or construction 

30 

pathways ending at the terminal resource. Process redundancy 
for a terminal resource exists when the number of processes 
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or construction pathways in a technology graph exceeds one. 
Failures such as an interruption in the supply of a renewable 
resource or the failure of a machine cause blocks along 
construction pathways. Greater numbers of processes or 
5 construction pathways to a terminal resource indicate a 
greater probability that a failure causing blocks can be 
overcome by following an alternate construction pathway to 
avoid the blocks. Accordingly, higher values of the first 
robust cons true tability measure for a terminal resource 
10 indicate higher levels of reliability for the processes which 
produce the finished resource represented by the terminal 
resource. Further, the technology graph extends the 
traditional notion of the makespan. 

The technology graph H = (V, E) also contains 
15 information defining a second rohust cons true tability measure 
of a terminal resource representing a finished resource. The 
second robust constructability measure for a terminal 
resource is defined as the rate at which the number of 
processes or construction pathways ending at the terminal 
resource increases with the makespan of the process. For 
example, suppose a terminal resource can be constructed with 
a makespan of N time steps with no process redundancy. Since 
there is no process redundancy, a block along the only 
construction pathway will prevent production of the terminal 
resource until the cause of the block is corrected. The 
relaxation of the required makespan to N + M time steps will 
increase the number of construction pathways ending at the 
terminal resource. Accordingly, failures causing blocks can 
be overcome by following an alternate construction pathway to 
the terminal resource. In other words, while the minimum 

30 ,14.- 

possible makespan increased by M time steps, the resulting 
greater numbers of processes or construction pathways to the 
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terminal resource led to greater reliability. Thus, the 
present invention extends the notion of a makespan to include 
the concept of robust constructability . 

The additional rojbust constructability measures 
involve vertices which exist within the construction pathways 
of two or more terminal resources. These resources 
represented by these vertices are called poly- functional 
intermediate resources because two or more terminal resources 
can be constructed from them. For example, consider two 
terminal resources representing a house and a house with a 
chimney. The poly- functional intermediate resources are the 
resources represented by vertices which exist within a 
construction pathway of the house and within a construction 
pathway of the house with the chimney. Thus, if a consumer 
requests a chimney in a house after the house has been 
constructed without a chimney, the chimney can be added to 
the house by backtracking along the construction pathway of 
the house to a poly- functional intermediate resource and 
proceeding from the poly- functional intermediate resource 
along a construction pathway of the house with a chimney. 

FIG. 5 provides a flow diagram 500 for locating and 
selecting poly- functional intermediate resources for a set of 
terminal resources 501 having a cardinality greater than or 
equal to two. In step 5 04, the method determines the 
vertices which exist within the construction pathways of each 
terminal resource in the set of terminal resources 501 in the 
technology graph H = (V, E) . Execution of step 504 yields a 
set of vertices 505 for each terminal resource in the set of 
terminal resources 501. Accordingly, the number of sets of 
vertices 505 resulting from execution of step 504 is equal to 
the cardinality of the set of terminal resources 501. In 
step 506, the method performs the intersection operation on 
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the sets of vertices 505. Execution of step 506 yields the 
vertices which exist within the construction pathways of 
every terminal resource in the set of terminal resources 5 01. 
In other words, execution of step 506 yields the poly- 
functional intermediate resources 507 of the set of terminal 
resources 501. 

In step 508, the method performs a selection 
operation on the poly- functional intermediate resources 507. 
Preferably, step 508 selects the poly- functional intermediate 
resource 707 with the smallest fractional construction 
pathway distance. The fractional construction pathway 
distance of a given poly- functional intermediate resource is 
defined as the ratio of two numbers. The numerator of the 
ratio is the sum of the smallest distances from the given 
poly- functional intermediate resource to each terminal 
resource in the set of terminal resources 501. The 
denominator of the ratio is the sum of the numerator and the 
sum of the smallest distances from each resource in the 
founder set to the given poly- functional intermediate 
resource . The distance between two vertices along a 
construction pathway in the technology graph H = (V, E) is 
defined as the number of hyperedges e on the construction 
pathway between the two vertices. The smallest distance 
between two vertices in the technology graph H = {V, E) is 
the number of hyperedges e on the shortest construction 
pa thway. 

Alternatively, step 508 considers the process 
redundancy in addition to the fractional construction pathway 
distance in the selection of the poly- functional intermediate 
resource 507. This alternative selection technique first 
locates the poly- functional intermediate resource 507 having 
the smallest fractional construction pathway distance . Next, 
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the alternative technique traverses the construction pathways 
from the poly- functional intermediate resource 507 having the 
smallest fractional construction pathway distance toward the 
founder set until it reaches a poly- functional intermediate 
5 resource 507 having a sufficiently high value of process 
redundancy . 

The method of FIG. 5 for locating and selecting 
poly- functional intermediate resources for a set of terminal 
resources 501 can also be executed on different subsets of 

1Q the power set of the set of terminal resources 501 to locate 
and select poly- functional intermediate resources for 
different subsets of the set of terminal resources. 

As indicated by the preceding discussion, the 
present invention identifies and selects the poly- functional 

15 resource which leads to process redundancy to achieve 

reliability and adaptability. Specifically, the resource 
providing nodes should ensure that there is an adequate 
inventory of the selected poly- functional resource to enable 
it to adapt to failures and changes in the computing 
environment . Resource providing nodes use the technology 

20 

graphs to determine how to satisfy the sub -requirements 
(e.g., in the "needs-a" property of a resource request) of a 
particular resource request. 

Technology graphs are further explained in co- 
pending U.S. application number 09/345,441 filed July 1, 1999 

25 

and titled, "An Adaptive and Reliable System and Method for 
Operations Management", the contents of which are herein 
incorporated by reference. 



Searching for Relations 

30 

In one embodiment, the method of the present 
invention for allocating resources searches for relations 
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between the resource requests 110, 112, 114, 116 and the 
resource offers 130, 132, 134, 136 using an active search. 
An exemplary active search includes an ants based 
reinforcement learning algorithm as described in co-pending 
5 patent application No. 09/368,215, filed 8/04/1999, and 

titled, A Method and System for Controlling Network Traffic, 
the contents of which are herein incorporated by reference. 

In another embodiment, the method of the present 
invention for allocating resources searches for relations 
10 between the resource requests 110, 112, 114, 116 and the 

resource providers 130, 132, 134, 136 uses a passive search. 
An exemplary passive search includes advertisements of the 
resource requests 110, 112, 114, 116 by clients and 
advertisements of the resource offers 130, 132, 134, 136 by 
resource providers. 

Market-Based Resource Allocation 

FIG. 6 displays a flow diagram of a method 600 for 
allocating resources using a market-based scheme which 
executes on a resource providing node 13 0, 132, 134, 13 6. In 
step 602, the market-based allocation method 600 receives 
resource requests 110, 112, 114, 116. In step 604, the 
method 600 receives bids for resource requests 110, 112, 114, 
116 from other resource provider nodes 12 0, 122, 124, 126, 
128 . 

25 

In the context of the market-based allocation 
method 60 0, the resource requests 110, 112, 114, 116 include 
a contract to pay a specified reward for a satisfaction of 
the request. The resource requests 110, 112, 114, 116 
further include a specified quality of service. In the 

30 

preferred embodiment, the resource requests 110, 112, 114, 
116 also includes a specified reward that varies with a 



20 
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delivered quality of service in comparison with the specified 
quality of service. Preferably, the specified quality of 
service includes a time for the satisfaction of the resource 
request . 

The contracted amount is paid in full only if the 
resource request 110, 112, 114, 116, is satisfied in 
accordance with its specified quality of service. 
Preferably, a portion of the contracted amount is paid to the 
resource providing node 120, 122, 124, 126, 128 that 
satisfied the resource request 110, 112, 114, 116 if the 
satisfaction was outside the specified quality of service. 
This portion is determined as a function of the received 
quality of service. Preferably, less cash is released for 
resource requests 110, 112, 114, 116 that are satisfied long 
after the specified time quality of service. Market-arbiter 
software calculates the cash reward earned by the satisfying 
resource providing node 120, 122, 124, 126, 128 and the 
amount owed by the client that transmitted the resource 
request 110, 112, 114, 116. These rewards and bills are 
accumulated over time and sent out at a low frequency so as 
to impose only a negligible communication load on the system 
100 . 

The bids for resource requests 110, 112, 114, 116 
from other resource providing nodes 120, 122, 124, 126 
include a price that will be paid for the resource request. 
Optionally, the bids for resource requests 110, 112, 114, 116 
could also include an expiration time or a margin. In the 
preferred embodiment, the bids for resource requests 110, 
112, 114, 116 includes a satisfaction profile defining a 
satisfaction of trading the resource request as a probability 
density function of one or more parameters. Exemplary 
parameters include a quality of service. Satisfaction 
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profiles are described in co-pending U.S. application number 
09/345,441, filed July 1, 1999, titled, "An Adaptive and 
Reliable System and Method for Operations Management", the 
contents of which are herein incorporated by reference. 

In step 606, the method 600 computes an expected 
return for processing the resource request 110, 112, 114, 
116. In step 608, the method 600 processes the resource 
request 110, 112, 114, 116 to optimize the expected return. 
Exemplary processing options include satisfying the resource 
request 110, 112, 114, 116, in step 610 or selling the 
resource request 110, 112, 114, 116 to another resource 
processing node 130, 132, 134, 136 in step 612. 

In step 614, the method 600 transmits bids for the 
resource requests 110, 112, 114, 116 to other resource 
provider nodes 130, 132, 134, 136. 

Each resource providing node 13 0, 132, 134, 13 6 
acts autonomously to optimize the value of some function 
combining its own expected return and that of some (zero or 
more) selected neighbors (not necessarily immediate 
topological neighbors) as explained more fully below. The 
expected return is based on its earnings from satisfying or 
selling resources requests 110, 112, 114, 116. Optionally, 
resource providing nodes 130, 132, 134, 136 learn to optimize 
their expected return using reinforcement learning. An 
exemplary reinforcement learning technique is Q-learning. 

Resource providing nodes 120, 122, 124, 126, 128 
receive feedback about their performance. This feedback is 
called a reward. However, in the reinforcement learning 
framework of the present invention, a resource providing 
nodes 120, 122, 124, 126, 128 does not merely act to optimize 
its immediate reward. Instead, it acts to optimize its 
return. In the preferred embodiment, the return includes an 
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expected future reward that is discounted to present value. 
As mentioned earlier, reward is based on "earnings" in a 
resource market called the market-based reward framework. 

When reinforcement learning is used to adjust the 
behavior of resource providing nodes 120, 122, 124, 126, 128, 
instantaneous rewards are based on the actual cash profit of 
the resource providing nodes 120, 122, 124, 126, 128 and 
optionally, the cash profit of neighboring resource providing 
nodes 120, 122, 124, 126, 128 (not necessarily topological 
neighbors) over some short past time period. Optionally, in 
order to prevent resource providing nodes 12 0, 122, 124, 12 6, 
128 from charging arbitrary prices in monopoly situations, 
excess profit can be removed (taxed) from those resource 
providing nodes 120, 122, 124, 126, 128 whose long-term 
discounted expected reward exceeds a predefined target. 

Each resource providing node 120, 122, 124, 126, 
128 communicates "bids" that specify how much it will pay for 
resource requests 110,112, 114, 116 having a particular 
specified quality of service, and a specified reward to other 
resource providing nodes 12 0, 122, 124, 126, 12 8. 
Preferably, each resource providing nodes 120, 122, 124, 126, 
128 communicates the "bids" to its topologically neighboring 
agents. Bids may also have an expiration time. Bids stand 
until they expire or until the resource providing nodes 12 0, 
122, 124, 126, 128 where a bid is held receives a message 
canceling and/or replacing the bid. Optionally, other 
quality of service parameters corresponding to the quality of 
service requirements of resource requests 110, 112, 114, 116 
are included in the bids . 

Resource requests 110,112, 114, 116 that are 
received by a resource providing node 12 0, 122, 124, 12 6, 12 8 
(either from an application program at the resource providing 
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nodes 120, 122, 124, 126, 128, or from another resource 
providing nodes 120, 122, 124, 126, 128) that do not conform 
to the parameters of an existing bid (e.g., insufficient 
contract value or too many in a given time period) do not 
5 require payment. Instead, these resource requests 110,112, 
114, 116 are owned by the resource providing nodes 120, 122, 
124, 126, 128, and may be sold. 

A market-based allocation method for data routing 
is explained in co-pending international patent application, 
10 having attorney docket number 9392-025-228, filed January 28, 
2000, and titled, "A Method and System for Routing Control in 
Communication Networks and for System Control", the contents 
of which are herein incorporated by reference. 

15 Locally- cooperative local reinforcement learning 

Having all resource providing nodes 120, 122, 124, 
126, 128 attempt to optimize their local expected return will 
not always result in the discovery of the globally optimum 
configuration as explained in "At Home in the Universe" by 
Stuart Kauffman, Oxford University Press, Chapter 11 in the 
context of an NK fitness landscape, the contents of which are 
herein incorporated by reference. This result occurs because 
actions taken by one resource providing node 12 0, 122, 124, 
12 6, 12 8 affects its state and possibly changes the context 
of the reward for its neighboring resource providing nodes 
120, 122, 124, 126, 128. 

Accordingly, in the preferred embodiment the 
present invention utilizes combinations of the following 
three semi-local strategies: 

30 



20 



25 



-21 - 



DCl - 244658 1 



patches In this technique, resource providing nodes 
120, 122, 124, 126, 128 are partitioned into 
disjoint subsets called patches. The patches may 
or may not be topologically contiguous. Within a 
patch, the actions of resource providing nodes 120, 
122, 124, 126, 128 are coordinated to maximize the 
aggregate figure of merit for the entire patch. 
The size and location of patches are parameters for 
this strategy. 

p A neighborhood is defined for a resource providing 
nodes 120, 122, 124, 126, 128 such that when a 
decision is made there, figures of merit at the 
current node and at a proportion p of neighboring 
nodes are taken into account. A neighborhood need 
not consist of the immediate topological neighbors 
of the resource providing nodes 120, 122, 124, 12 6, 
128 . 

tau Only a fraction (called tau) of the resource 
providing nodes 120, 122, 124, 126, 128 make 
decisions that change the portions of their state 
that affect the reward of other resource providing 
nodes 120, 122, 124, 126, 128 at the same time. 
FIG. 7 provides a flow diagram 700 for determining 
optimal values of parameters of methods performing resource 
allocation and load balancing. In step 710, the present 
invention defines a global performance measure for the 
network. In step 72 0, the present invention defines an 
optimization algorithm having at least one parameter. 
Exemplary parameters include the size and location of 
patches, the neighborhood, p where the expected reward are 
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considered in making a decision and the fraction, tau, of the 
agents that change portions of their state that affect the 
reward of other agents. In step 730, the method 700 
constructs a landscape representation for values of the 
parameters and their associated global performance measure. 
In step 740, the method optimizes over the landscape to 
produce optimal values for the parameters. 

In the preferred embodiment, the present invention 
uses either patches or p or both to define a modified reward 
and hence, a return, for a resource providing nodes 12 0, 122, 
124, 126, 128 in the resource allocation problem. As 
explained earlier, the figure of merit for a resource 
providing nodes 120, 122, 124, 126, 128 is its earnings in 
the market -based framework. Optionally, the present 
invention uses the tau strategy either alone, or in 
conjunction with p and "patches" to limit the opportunities 
resource providing nodes 12 0, 122, 124, 126, 128 have for 
making decisions that affect the return of other resource 
providing nodes 120, 122, 124, 126, 128. For example, the 
reward for a resource providing node 12 0, 122, 124, 126, 12 8 
is the aggregate earnings for a region of resource providing 
nodes 120, 122, 124, 126, 128 (a patch) and the bids and for 
only a fraction tau of resource providing nodes 12 0, 122, 
124, 126, 128 change at the same time. 

Preferably, the parameters for these strategies 
(the fraction p, the fraction tau and the number and 
membership of patches) are global in nature. In other words, 
the values of these parameters are the same for all resource 
providing nodes 12 0, 122, 124, 126, 128. Alternatively, the 
values of the parameters may vary among the resource 
providing nodes 120, 122, 124, 126, 128. 



-23 - 



DCl -244658 1 



Preferably, the present invention sets these parameters 
as follows: 

First, a global performance measure is defined. 
Preferably, the global performance measure is the specified 

5 quality of service in relation to the quality of service of 
the satisfied resource request 110, 112, 114,116. Second, 
the algorithm has an outer loop that varies these parameters 
in order to maximize the global performance measure in 
accordance with techniques for searching landscapes as also 

IQ described in the co-pending international patent application 
having attorney-docket number 9392-016-228, titled, "A 
System and Method for the Analysis and Prediction of Economic 
Markets", filed December 22, 1999 at the U.S. receiving 
office, the contents of which are herein incorporated by 
reference. 

Preferably, each value of the global parameters 
governing p, patches, tau, and reinforcement learning 
features, defines a point in the global parameter space. 
With respect to this point, the method for allocating 
resources of the present invention achieves a given global 

20 

fitness. The distribution of global fitness values over the 
global parameter space constitutes a "fitness landscape" for 
the entire bandwidth- agent system. Such landscapes typically 
have many peaks of high fitness, and statistical features 
such as correlation lengths and other features as described 

25 

in co-pending international patent application number 
PCT/US99/19916 , titled, "A Method for Optimal Search on a 
Technology Landscape", the contents of which are herein 
incorporated by reference. In the preferred embodiment, 
these features are used to optimize an evolutionary search in 

30 

the global parameter space to achieve values of p, patches, 
tau, and the internal parameters of the reinforcement 
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learning algorithm that optimize the learning performance of 
the resource allocation system in a stationary environment 
with respect to load and other use factor distribution. 
Preferably, the same search procedures are also used to 
persistently tune the global parameters of the resource 
allocation system in a non- stationary environment with 
respect to load and other use factor distributions. 

By tuning of the global parameters to optimize learning, 
the present invention is "self calibrating". In other 
words, the invention includes an outer loop in its learning 
procedure to optimize learning itself, where co-evolutionary 
learning is in turn controlled by combinations of p, patches, 
and tau, plus features of the reinforcement learning 
algorithm. The inclusion of features of fitness landscapes 
aids optimal search in this outer loop for global parameter 
values that themselves optimize learning by the resource 
allocation system in stationary and non- stationary 
environments . 

Use of p, tau, or patches aids adaptive search on rugged 
landscapes because, each by itself, causes the evolving 
system to ignore some of the constraints some of the time. 
Judicious balancing of ignoring some of the constraints some 
of the time with search over the landscape optimizes the 
balance between "exploitation" and "exploration" . In 
particular, without the capacity to ignore some of the 
constraints some of the time, adaptive systems tend to become 
trapped on local, very sub-optimal peaks. The capacity to 
ignore some of the constraints some of the time allows the 
total adapting system to escape badly sub-optimal peaks on 
the fitness landscape and thereby, enables further searching. 
In the preferred embodiment, the present invention tunes p, 
tau, or patches either alone or in conjunction with one 
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another to find the proper balance between stubborn 
exploitation hill climbing and wider exploration search. 

The optimal character of either tan alone or patches 
alone, is such that the total adaptive system is poised 

5 slightly in the ordered regime, near a phase transition 

between order and chaos. See e.g. "At Home in the Universe" 
by Kauffman, Chapters 1,4, 5 and 11, the contents of which 
are herein incorporated by reference and "The Origins of 
Order, Stuart Kauffman, Oxford University Press, 1993, 

10 Chapters 5 and 6, the contents of which are herein 

incorporated by reference. For the p parameter alone, the 
optimal value of p is not associated with a phase transition. 

Without limitation, the embodiments of the present 
invention are described in the illustrative context of a 
solution using tau, p, and patches. However, it will be 
apparent to persons of ordinary skill in the art that other 
techniques that ignore some of the constraints some of the 
time could be used to embody the aspect of the present 
invention which includes defining an algorithm having one or 
more parameters, defining a global performance measure, 

20 constructing a landscape representation for values of the 

parameters and their associated global performance value, and 
optimizing over the landscape to determine optimal values for 
the parameters. Other exemplary techniques that ignore some 
of the constraints some of the time include simulated 

25 annealing, or optimization at a fixed temperature. In 

general, the present invention employs the union of any of 
these means to ignore some of the constraints some of the 
time together with reinforcement learning to achieve good 
problem optimization. 

3Q Further, there are local characteristics in the 

adapting system itself that can be used to test locally that 
the system is optimizing well. In particular, with patches 
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alone and tau alone, the optimal values of these parameters 
for adaptation are associated with a power law distribution 
of small and large avalanches of changes in the system as 
changes introduced at one point to improve the system unleash 
5 a cascade of changes at nearby points in the system. The 

present invention includes the use of local diagnostics such 
as a power law distribution of avalanches of change, which 
are measured either in terms of the size of the avalanches, 
or in terms of the duration of persistent changes at any 
10 single site in the network. 

The present invention's use of any combination of 
the above strategies, together with reinforcement learning in 
any of its versions, give it an advantage over prior art 
resource allocation methods because these strategies address 
15 many problems that could arise including the following: 

slow convergence to optimal allocation patterns, 

oscillation of network load, and 

locally beneficial but globally harmful routing 

patterns . 

Without limitation, the embodiments of the present 
20 . 
invention have been described in the illustrative context of 

a method for allocating resources. However, it is apparent 

to persons of ordinary skill in the art that other contexts 

could be used to embody the aspect of the present invention 

which includes defining an algorithm having one or more 

25 parameters, defining a global performance measure, 

constructing a landscape representation for values of the 

parameters and their associated global performance value, and 

optimizing over the landscape to determine optimal values for 

the parameters . 

3 q For example, the present invention could be used 

for operations management as explained in co-pending U.S. 
patent application No. 09/345,441, titled, "An Adaptive and 
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Reliable System and Method for Operations management" and 
filed on July 1, 1999, the contents of which are herein 
incorporated by reference. That patent application describes 
a model of an enterprise in its competitive environment, 
based on technology graphs that support a nodes and flow 
model of an organization, plus a management structure. The 
present invention, using agents to represent resources and 
operations in the enterprise model, coupled to reinforcement 
learning, p, patches and tau, is used advantageously to 
create a model of a learning organization that learns how to 
adapt well in its local environment. By use of the outer 
loop described above, good global parameter values for p, 
patches, tau, and the reinforcement learning algorithm are 
discovered. In turn, these values are used to help create 
homologous action patterns in the real organization. For 
example, the homologous action patters can be created by 
tuning the partitioning of the organization into patches, by 
tuning how decisions at one point in the real organization 
are taken with respect to a prospective benefit of a fraction 
p of the other points in the organization affected by the 
first point, and by tuning what fraction, tau, of points in 
the organization should try operational and other experiments 
to improve performance . 

In addition, the distribution of contract values 
and rewards in the reinforcement algorithm can be used to 
help find good incentive structures to mediate behavior by 
human agents in the real organization to achieve the overall 
adaptive and agile performance of the real organization. 

In addition to the use of the invention to find 
good global parameters to instantiate in the real 
organization, the same invention can be used to find good 
global parameter values to utilize in the model of the 
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organization itself to use that model as a decision support 

tool, teaching tool, etc. 

Further, the present invention is also applicable 
to portfolio management, risk management, scheduling and 
routing problems, logistic problems, supply chain problems 
and other practical problems characterized by many 
interacting factors. 

FIG. 8 discloses a representative computing system 
810 in conjunction with which the embodiments of the present 
invention may be implemented and executed. Computing system 
810 may be a personal computer, workstation, or a larger 
system such as a minicomputer. However, one skilled in the 
art of computer systems will understand that the present 
invention is not limited to a particular class or model of 
computer . 

As shown in FIG. 9 , representative computing system 
810 includes a central processing unit (CPU) 812, a memory 
unit 814, one or more storage devices 816, an input device 
818, an output device 82 0, and communication interface 822. 
A system bus 824 is provided for communications between these 
elements. Computing system 810 may additionally function 
through use of an operating system such as Windows, DOS, or 
UNIX. However, one skilled in the art of computing systems 
will understand that the present invention is not limited to 
a particular configuration or operating system. 

Storage devices 816 may illustratively include one 
or more floppy or hard disk drives, CD-ROMs, DVDs, or tapes. 
Input device 818 comprises a keyboard, mouse, microphone, or 
other similar device. Output device 820 is a computer 
monitor or any other known computer output device. 
Communication interface 822 may be a modem, a network 



-29 - 



DCl - 244658 1 



interface, or other connection to external electronic 
devices, such as a serial or parallel port 

While the above invention has been described with 
5 reference to certain preferred embodiments, the scope of the 
present invention is not limited to these embodiments. One 
skill in the art may find variations of these preferred 
embodiments which, nevertheless, fall within the spirit of 
the present invention, whose scope is defined by the claims 
1Q set forth below. 
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Claims 



1. A method for resource allocation comprising 

5 the steps of : 

receiving a plurality of resource requests and a 
plurality of resource offers, said resource requests and said 
resource offers having one or more properties; and 

determining at least one relation between at least 
one of said resource requests and at least one of said 
resource offers to identify at least one matching said 
resource offer for said resource request; and 

allocating said at least one resource request with 
said at least one matching resource offer. 

15 

2. A method for allocating resources as in claim 1 
wherein said determining at least one relation step comprises 
the step of : 

searching for said at least one relation between 
said at least one of said resource requests and said at least 
one of said resource offers. 

3. A method for allocating resources as in claim 2 
wherein said searching for at least one relation step further 
comprises the step of: 

identifying matching ones of said one or more 
properties of said resource offers with said one or more 
properties of said resource requests. 

4. A method for allocating resources as in claim 3 

30 

wherein said determining at least one relation step further 
comprises the step of: 



20 



25 
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evaluating said at least one relation by 
determining how well said one or more properties of said at 
least one resource offer match with said one or more 
properties of said at least one resource request; and 

selecting said at least one relation that is 
optimal with respect to said evaluation. 



5 . A method for resource allocation as in claim 1 
wherein said one or more properties comprise at least one 
identity . 

6. A method for resource allocation as in claim 1 
wherein said one or more properties comprise at least one 
ability. 



7 . A method for resource allocation as in claim 1 
wherein said one or more properties comprise at least one 
requirement . 



8. A method for resource allocation as in claim 1 
wherein said one or more properties comprise at least one 
quality of service. 



9. A method for resource allocation as in claim 1 
wherein said plurality of resource offers comprise data 
resources . 



10. A method for resource allocation as in claim 1 
wherein said plurality of resource offers comprise software 
resources . 
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11. A method for resource allocation as in claim 1 
wherein said plurality of resource offers comprise 
computational resources. 



12 . A method for resource allocation as in claim 1 
wherein said plurality of resource offers comprise 
communication resources. 



13 . A method for resource allocation as in claim 1 
wherein said determining at least one relation step comprises 
the step of determining at least one complement relation 
between said at least one resource request and said at least 
one resource offer. 



14. A method for resource allocation as in claim 1 
wherein said determining at least one relation step comprises 
the step of determining at least one substitute relation 
between said at least one resource request and said at least 
one resource offer. 



15. A method for resource allocation as in claim 1 
further comprising the step of constructing a graph 
representing said at least one resource request, said at 
least one resource offer and said at least one relation. 



16. A method for resource allocation as in claim 
15 wherein said constructing a graph step comprises the steps 
of: 

creating one or more vertices representing said at 
least one resource offer and said at least one resource 
request ; 
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creating at least one edge representing said at 
least one relation, said at least one edge having terminals 
corresponding to said at least one resource offer and said at 
least one resource request having said at least one relation. 

17. Computer executable software code stored on a 
computer readable medium, the code for resource allocation 
the code comprising: 

code to receive a plurality of resource requests 
and a plurality of resource offers, said resource requests 
and said resource offers having one or more properties; and 

code to determine at least one relation between at 
least one of said resource requests and at least one of said 
resource offers to identify at least one matching said 
resource offer for said resource request; and 

code to allocate said at least one resource request 
with said at least one matching resource offer. 

18. A programmed component for resource allocation 
comprising at least one memory having at least one region 
storing computer executable program code and at least one 
processor for executing the program code stored in said 
memory, wherein the program code comprises: 

code to receive a plurality of resource requests 
and a plurality of resource offers, said resource requests 
and said resource offers having one or more properties; and 

code to determine at least one relation between at 
least one of said resource requests and at least one of said 
resource offers to identify at least one matching said 
resource offer for said resource request; and 

code to allocate said at least one resource request 
with said at least one matching resource offer. 
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19. A method for allocating a plurality of 
resources comprising the steps of : 

receiving information for at least one resource 

request ; 

computing an expected return for processing said 
resource request from said information; and 

processing said at least one resource request to 
optimize said expected return. 

20. A method for allocating a plurality of 
resources as in claim 19 wherein said information for said at 
least one resource request comprises a contract to pay a 
specified reward for a satisfaction of said at least one 
resource request. 

21. A method for allocating a plurality of 
resources as in claim 20 wherein said information for said at 
least one resource request further comprises a specified 
quality of service. 

22 . A method for allocating a plurality of 
resources as in claim 21 wherein said specified reward varies 
with a satisfied quality of service in comparison with said 
specified quality of service. 

23. A method for allocating a plurality of 
resources as in claim 21 wherein said specified qualify of 
service comprises a time for delivery of said satisfaction of 
said resource request . 
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24. A method for allocating a plurality of 
resources as in claim 19 wherein said information for said at 
least one resource request comprises at least one bid 
specifying a price that will be paid for said at least one 

5 request . 

25. A method for allocating a plurality of 
resources as in claim 24 wherein said at least one bid 
further comprises an expiration time. 

10 

26. A method for allocating a plurality of 
resources as in claim 24 wherein said at least one bid 
further comprises a margin. 



15 



20 



27. A method for allocating a plurality of 
resources as in claim 24 wherein said at least one bid 
further comprises a satisfaction profile defining a 
satisfaction of trading said at least one request as a 
probability density function of at least one parameter. 

28. A method for allocating a plurality of 
resources as in claim 27 wherein said at least one parameter 
of said probability density function comprises a quality of 
service . 

29. A method for allocating a plurality of 
resources as in claim 19 wherein said expected return for 
said processing of said at least one resource request is an 
expected reward discounted to present value. 

30 

30. A method for allocating a plurality of 
resources as in claim 19 further comprising the step of 



25 
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partitioning said plurality of resources into one or more 
patches . 

31. A method for allocating a plurality of 
resources as in claim 30 wherein said processing said at 
least one resource request comprises the step of optimizing 
said expected return of said patch. 

32. A method for allocating a plurality of 
resources as in claim 19 wherein said computing an expected 
return step comprises the step of: 

selecting a portion p of said plurality of 
resources; and 

computing said expected return of said selected 
portion p of said plurality of resources. 

33 . A method for allocating a plurality of 
resources as in claim 32 wherein said processing said at 
least one resource request comprises the step of optimizing 
said expected return of said selected portion p of said 
plurality of resources. 

34. A method for allocating a plurality of 
resources as in claim 19 wherein said processing said at 
least one resource request comprises the step of satisfying 
said resource request . 

35. A method for allocating a plurality of 
resources as in claim 19 wherein said processing said at 
least one resource request comprises the step of selling said 
resource request . 
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36. A method for allocating a plurality of 
resources as in claim 19 further comprising the step of 
transmitting at least one bid specifying a price that will be 
paid for said at least one resource request. 

5 

37. Computer executable software code stored on a 
computer readable medium, the code for allocating a plurality 
of resources, the code comprising: 

code to receive information for at least one 
resource request; 

code to compute an expected return for processing 
said resource request from said information; and 

code to process said at least one resource request 
to optimize said expected return. 



15 



20 



25 



30 



38. A programmed component for allocating a 
plurality of resources comprising at least one memory having 
at least one region storing computer executable program code 
and at least one processor for executing the program code 
stored in said memory, wherein the program code comprises: 

code to receive information for at least one 
resource request ; 

code to compute an expected return for processing 
said resource request from said information; and 

code to process said at least one resource request 
to optimize said expected return. 

39. A method for allocating a plurality of 
resources comprising the steps of: 

defining at least one algorithm having one or more 
parameters for allocating the resources ; 
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defining at least one global performance measure of 
said at least one algorithm; 

executing said algorithm for a plurality of 
different values of said one or more parameters to generate a 
corresponding plurality of values for said global performance 
measure ; 

constructing a fitness landscape from said values 
of said parameters and said corresponding values of said 
global performance measure; and 

optimizing over said fitness landscape to generate 
optimal values for said at least one parameter. 

40. A method for allocating a plurality of resources as 
in claim 39 wherein said defining at least one algorithm step 
comprises the steps of : 

communicating information for at least one resource 

request ; 

computing an expected return for processing said 
resource request from said information; and 

processing said at least one resource request to 
optimize said expected return. 

41. A method for allocating a plurality of 
resources as in claim 40 wherein said at least one parameter 
comprises a proportion p of said plurality of resources. 

42. A method for allocating a plurality of 
resources as in claim 41 wherein said computing an expected 
return step comprises the step of: 

computing said expected return of said proportion p 
of said one or more software agents. 



-39 - 



DCl -244658 1 



43. A method for allocating a plurality of 
resources as in claim 40 wherein said at least one parameter 
comprises a size of one or more patches of said plurality of 
resources and a location of said patches. 



44. A method for allocating a plurality of 
resources as in claim 43 wherein said processing said at 
least one resource request step comprises the step of: 

optimizing said expected return of said patch. 

45. A method for allocating a plurality of 
resources as in claim 40 wherein said at least one parameter 
comprises a fraction, tau, of said plurality of resources. 



46. A method for allocating a plurality of 
resources as in claim 45 wherein only said fraction, tau, of 
said plurality of resources communication information for 
said at least one resource request at one time. 



47. Computer executable software code stored on a 
computer readable medium, the code allocating a plurality of 
resources, the code comprising: 

code to define at least one algorithm having one or 
more parameters for allocating the resources; 

code to define at least one global performance 
measure of said at least one algorithm; 

code to execute said algorithm for a plurality of 
different values of said one or more parameters to generate a 
corresponding plurality of values for said global performance 
measure ; 
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code to construct a fitness landscape from said 
values of said parameters and said corresponding values of 
said global performance measure; and 

code to optimize over said fitness landscape to 
5 generate optimal values for said at least one parameter. 



48 . A programmed component for allocating a 
plurality of resources comprising at least one memory having 
at least one region storing computer executable program code 
20 and at least one processor for executing the program code 
stored in said memory, wherein the program code comprises : 

code to define at least one algorithm having one or 
more parameters for allocating the resources; 

code to define at least one global performance 
25 measure of said at least one algorithm; 

code to execute said algorithm for a plurality of 
different values of said one or more parameters to generate a 
corresponding plurality of values for said global performance 
measure ; 

code to construct a fitness landscape from said 
values of said parameters and said corresponding values of 
said global performance measure; and 

code to optimize over said fitness landscape to 
generate optimal values for said at least one parameter. 

25 



30 



-41 - 



DCl - 244658 1 



Abstract 

5 The present invention relates generally to a method 

and system for resource allocation. More particularly, the 
present invention allocates resources using technology 
graphs, passive and active searching, reinforcement learning, 
market driven decision making, reinforcement learning as well 

10 as p, tau, and patches techniques. 
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